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ABSTRACT 

We present a theoretical framework that can be used to treat 
approximation techniques for very general classes of parameter estimation 
problems involving distributed systems that are either first or second order in 
time. Using the approach developed, one can obtain both convergence and 
stability (continuous dependence of parameter estimates with respect to the 
observations) under very weak regularity and compactness assumptions on the 
set of admissible parameters. This unified theory can be used for many 
problems found in the recent literature and in many cases offers significant 
improvements to existing results. 


*This research was supported in part by the National Science Foundation 
under NSF Grant MCS-8504316, by the Air Force Office of Scientific 
Research under Contract F49620-86-C-01 1, and the National Aeronautics 
and Space Administration under NASA Grant NAG-1-517. Part of this 
research was carried out while the authors were visiting scientists at the 
Institute for Computer Applications in Science and Engineering (ICASE), 
NASA Langley Research Center, Hampton, VA, which is operated under 
NASA Contract No. NAS1-18107. 


i 


- 1 - 


1. Introduction 

An important class of scientific problems related to the control of flexible 
space structures entails the estimation of parameters in distributed or partial 
differential equation models. These inverse or parameter identification 
problems arise in several contexts. A rather obvious class of such problems 
involves the development and identification of physical models for flexible 
structures - e.g., see [BC1], [BC2], [BCR], [BR2], [BPR], [BWIC], [BFW]. In these 
problems one typically investigates models from structural mechanics which 
contain parameters representing elastic properties (such as stiffness, damping) of 
materials. The inverse problems then consist of estimating these parameters 
using data obtained from observations of the system response to dynamic loading 
or displacement. The partial differential equation models usually are second 
order in time, higher order in space (for example, the Euler-Bernoulli or 
Timoshenko beam models and their higher dimensional analogues for plates, etc.). 
The related inverse problems can prove quite challenging, but are extremely 
important as a precursor to and integral part of the design of control strategies 
for space structures. 

A second, less obvious, class of inverse problems that are important in the 
study of space structures involves those arising in material testing, in particular, 
in the area of nondestructive evaluation of materials. These problems, such as 
those related to the detection of structural flaws (broken fibers, delaminations, 
cracks) in fiber reinforced composite materials, have been addressed at NASA 
(for example, in connection with evaluation of the solid rocket motors for the 
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Space Shuttle) using a variety of techniques (acoustic, thermal, etc.). One 
technique [HWW], [BKO] entails the use of thermal diffusivity properties (a 
thermal tomography) to characterize materials. The resulting inverse problems 
involve using boundary observations for the estimation of thermal diffusion 
coefficients and interior boundaries of the domain (i.e. the geometrical 
structure of the system domain) for the heat equation in two or three space 
dimensions. These so-called domain identification problems thus constitute an 
important class of parabolic system identification problems related to space 
structures. 

In this paper we present a general convergence/stability (continuous 
dependence on observations) framework for approximation methods to treat 
parameter identification problems (see [BCK]) involving distributed parameter 
systems. This new parameter estimation convergence framework combines a 
weak version of the system (in terms of sesquilinear forms in the spirit used 
in [BK1]) with the resolvent convergence form of the Trotter-Kato 
approximation theorem [P], [BK2]. The very general convergence results 
depend (in addition, of course, to the usual properties for the associated 
approximating subspaces) on three properties of the parameter (q € Q) 
dependent sesquilinear form <^q)(-,-) describing the system: (A) continuity 
(with respect to the parameter); (B) uniform (in the parameter) coercivitv: and 
(C) uniform (in the parameter) boundedness. The approach permits one to 
give convergence and stability arguments in inverse problems under extremely 
weak compactness assumptions on the admissible parameter spaces Q 
(equivalent to those in typical variational or weak approaches— see [Bl], 
[BCR]) without requiring knowledge of smoothness of solutions usually a part 
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of the variational and general finite element type arguments. Thus this 
approach combines in a single framework the best features of a semigroup 
approximation approach (i.e., the Trotter-Kato theorem) with the best features 
of a variational approach (weak assumptions on Q). The weakening of the 
compactness criteria on Q is of great computational importance since the 
constraints associated with these criteria should be implemented in problems 
(see [BI] for further discussion and examples). 

An additional significant feature of the approach described in this paper 
is that for first order systems it yields directly a stronger convergence (in the 
spatial coordinate) than. one readily obtains without extra effort in either the 
convergence arguments using the usual operator convergence form of the 
Trotter-Kato formulation (e.g., see Theorem 2.2 of [BCK] and Theorem 2.4 of 
[KW]) or the finite element type variational arguments (e.g., see, Theorem 4.1 
of [BR1] and the related remarks in [BKL1]). 

While the approach we present here does involve coercivity of certain 
sesquilinear forms associated with the system dynamics, its applicability is not 
restricted to parabolic systems which generate analytic semigroups. As we 
shall demonstrate in Section 3 below, it can be used to treat problems in which 
the underlying semigroup is not analytic (e.g., those involving Euler-Bernoulli 
equations for beams with Various types of damping-viscous, Kelvin-Voigt, 
spatial hysteresis), improving substantially on some of the currently known 
results for these problems. With appropriate modifications, this theoretical 
framework can be generalized to also allow an elegant treatment of problems 
involving functional partial differential equations, e.g., beams with Boltzmann 
damping (i.e., time hysteresis) -- see [BFW]. 
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2. First Order Systems 

We consider first order systems dependent on parameters q € Q described 
by an abstract equation 


u(t) = A(q)u(t) + F(t,q) 
u(0) = u 0 (q) 


( 2 . 1 ) 


in a Hilbert space H. The admissible parameter space Q is a metric space 
with metric d and for q e Q, we assume that A(q) is the infinitesimal 
generator of a C Q semigroup T(t;q) on H. We assume that we are given 
observations if € H for the mild solution values u(tj,q) of (2.1); i.e., we solve 
(2.1) in the sense 

u(t;q) = T(t;q)u 0 (q) + f T(t-i>;q)F(s,q)ds (2.2) 

J o 

in H. We then consider the least squares identification (ID) problem of 
minimizing over q e Q the functional 


J(q) - f |u(tj;q) - Uj| 2 . (2.3) 

Such problems are, in general, infinite dimensional in both the state u and 
the parameter q and thus one must consider a sequence of computationally 
tractable approximating problems. These can be, for our purposes, best 
described in terms of parameter dependent sesquilinear forms o(q)(-,-) 
associated with (2.1) or (2.2) (i.e., forms which define the operators A(q) in 
(2.1)). For more details and examples in the parabolic case, we refer the 
reader to [BK1]. Briefly, let V and H be Hilbert spaces with V continuously 
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and densely imbedded in H. Denote a family of parameter dependent 
sesquilinear forms by o(q): V x V -* C, q € Q. We assume that a possesses the 
following properties: 

(A) Continuity: For q,q € Q, we have for all 0,0 € V 

Kq)(0,0) - o(q)(0,0)l « d(q,q)|0| v |0| v . 

(B) Coercivitv: There exist > 0 and some real X Q such that for q € Q, 
0 e V we have 

o(q)(0,0) + \,|0& * c^ 2 . 

(C) Boundedness: There exist c 2 > 0 such that for q € Q, 0,0 € V we have 

|o(q)(0,0)| * c 2 |0| v | 41 v - 

Under these assumptions, a defines in the usual manner (e.g., see [K], [S]) 
operators A(q) such that o(q)(0,0) = <— A(q)0,0> H for 0 € dom(A(q)), 0 € V with 
dom(A(q)) dense in V. Furthermore, A(q) is the generator of an analytic 
semigroup T(t;q) on H (indeed, A(q) is sectorial with (XI — A(q))dom(A(q)) = H) 
and mild solutions of (2.1) possess additional regularity (e.g. see Chap. 4 of [P] and 
Chap. 4 of [5]) under appropriate assumptions on F. Property (B) guarantees that 
for X > X Q the resolvent operator R^(A(q)) = (XI - A(q))' 1 exists as a bounded 
operator on H; in fact, it follows readily from (B) that (assuming |-| H $ k|-| v ) we 
have 

Cl |R x (A(q))0|* « <0,R x (A(q))0> H 
4 l0l H |R\(A(q))0l H 
* k 2 M v |R X (A(q))<M v 
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and hence for <l> € V 


k 2 

|R x (A(q))<M v < — I<«v (2- 4 ) 

c i 


while for if) € H we find 

k 2 

|R x (A(q))<H H < —1% ( 2 - 5 > 

c i 

In addition, one can use (B) and (A) to argue that q -» R x (A(q)) is a 
continuous mapping from Q to V. It is these ideas that can be modified to 
give resolvent convergence in the approximation schemes which we introduce 
next. 

We consider Galerkin type approximations in the context of sesquilinear 
forms (e.g., see [BK1] for further details). Let H N be a family of finite 
dimensional subspaces of H satisfying P N z -* z for z € H where P N is the 
orthogonal projection of H onto H N . We further assume that H N C V and that 
the family possesses certain V-approximation properties to be specified below. 
If we now consider the restriction of o(q)(-,-) to H N * H N , we obtain 
operators A N (q): H N -* H N which, because of (B), satisfy a uniform dissipative 
inequality and can be shown to generate semigroups T N (t;q) in H N . These are 
then used to define approximating systems for (2.2): 

u N (t;q) = T N (t;q)P N u 0 (q) + f T N (t-s;q)P N F(s,q)ds. (2.6) 

J o 

One thus obtains a sequence of approximating ID problems consisting of 
minimizing over Q 


J N (q) = I |u N (t.;q) - u.| 2 . 


(2.7) 
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In problems where Q is infinite dimensional (the usual case in many inverse 
problems of interest), on must also make approximations Q M for Q (see [BD] 
[BR2] for details). To include this aspect of the problem in our discussion 
here does not entail any essential mathematical difficulties. Since it would 
increase the notational clutter and provide no new mathematical insight, we 
do not pursue the theory in that generality. 

To obtain convergence and continuous dependence (of parameter estimates 
with respect to observations) results for the solutions q N of minimizing J N in 
(2.7), it suffices under the assumption that (Q,d) is a compact space (see [Bl]) 
to argue: for arbitrary (q N ) C Q with q N -» q we have u N (t;q N ) -» u(t;q) for each 
t. Under reasonable assumptions on F and u 0> . this can be argued if one first 
shows that T N (t;q N )P N z -» T(t;q)z for arbitrary q N -* q and z € H. To do this 
one can use several versions of the Trotter- Kato theorem [P], [BK2], We state 
precisely the "resolvent convergence form" of this theorem in a form that is 
general enough for us to use in several contexts subsequently in this paper. 

Let X and X N , N = 1,2 be Hilbert spaces, X N C X, and let P N : X - X N 

be the orthogonal projection of X onto X N . We assume the X N ’s approximate 
X in the sense that P N x -» x for all x € X. Then we have: 

Theorem 2.1: Let A N and A be infinitesimal generators of C 0 semigroups S N (t) 
and S(t) on X N and X, respectively, satisfying : 

(i) There exist constants u, M such that |S N (t)| $ Me 1 ** for each N; 

(ii) There exists X € p(A) nJJ =1 p(A N ) with Re X > w such that 
R^(4 n )P n x -» R^(^)x for each x € X. 


Then 
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(iii) For each x € X, S N (t)/ >N x -» S(t)x uniformly in t on any compact interval 

[O.tj]. 

We may assume without loss of generality that the constants u and M of 
(i) are chosen so that S(t) also satisfies the bound in (i). 

There is an alternate version (we shall for obvious reasons refer to this 
version as the "operator convergence form") of the Trotter-Kato theorem which 
has been frequently used [BCK], [BC1], [BC2], [R] in parameter estimation 
problems. This version replaces condition (ii) above by the condition: 

(ii’) There exists a set I) dense in X such that for some X, (XI - A)T) is dense 
in X and A N P N x -* Ax for all x € D. 

As we have already noted, it is the resolvent convergence form of this 
theorem which we shall wish to make use of in our theoretical framework 
since, as we shall show, for our first order systems condition (ii) will follow 
readily from (A), (B), (C) and a condition on how H N approximates H. The 
entire theory will require only that Q be compact in the metric d used in the 
continuity statement (A) for the sesquilinear form. This will, in general, be 
much weaker than that compactness required for use in proofs employing 
condition (ii’), since the requirements on "q N -* q" to insure A N (q N )P N z - A(q)z 
typically involve convergence of some derivatives of the q N . For example, in 
first order parabolic equations containing variable coefficients to be 
estimated, this results in a requirement that Q be compact in H 1 whereas use 
of the sesquilinear form approach developed in this paper requires only 
compactness of Q in C or L” (see [Bl] for further discussion of this point). 
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Before turning to our convergence arguments, we state the convergence 
properties required of the approximating subspaces H N alluded to above. 
Throughout our discussions, we shall assume: 

(Cl) For each z € V, there exists z N € H N such that |z - z N | v -* 0 as 
N 

Theorem 2.2: Let conditions (A), (B), (C) and (Cl) hold and q N -* q in Q. Then 
for X = X Q , R^(A N (q N ))P N z -* R^(A(q))z in the V norm for any z € H. 

Proof: First note that since A N (q N ) results from the restriction of a to H N x 
H n , we can choose X = X Q in (B) so that X € p( A(q)) r£ =1 p(A N (q N )) with X > w 
(and indeed bounds similar to those of (2.4), (2.5) hold for R^(A N (q N ))P N ). 

Let z e H be arbitrary and for notational convenience put w = w(q) £ 
R\(A(q))z and w N = w N (q N ) s R^(A N (q N ))P N z. Since w € dom(A(q)) C V, we 
may use condition (Cl) to define a sequence (w N ) satisfying |w N — w| v -* 0 as 
N -» ® . 

We wish to show |w N - w| v -» 0. If we argue that |w N - w N | y -* 0, the 
desired results follow immediately from the triangle inequality. 

Let z N = w N - w N so that z N € H N C V. Using <*,• > to denote the inner 
product in H, we find 

o(q)(w,z N ) = <— A(q)w,z N > = <(X — A(q))w,z N > — X<w,z N > 

= <z,z N > — X<w,z N > 


and 
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I 




o(q N )(w N ,z N ) = <— A N (q N )w N ,z N > = <(X - A N (q N ))w N ,z N > — X<w N ,z N > 

= <P N z,z N > - \<w N ,z N > 

= <Z,Z N > — X<W N ,Z N >. 

Hence we have 

o(q)(w,z N ) = o(q N )(w N ,z N ) + X<w N — w,z N >. 

Using this with (B) we obtain 

Cj|z N |^ « o(q N )(z N ,z N ) + X|z N | 2 = o(q N )(w N ,z N ) - o(q N )(w N ,z N ) + X|z N |^ 
= o(q)(w,z N ) + X<w-w N ,z N > - o(q N )(w N ,z N ) + X|z N |^ 

= o(q)(w,z N ) - o(q N )(w,z N ) + o(q N )(w-w N ,z N ) 

+ X(<w-w N ,z N > + |z N |g). 

If we use (A) and (C), we thus have 

c^z* 1 !^ « d(q,q N )|w| v |z N | v + c 2 |w-w N | y |z N | y + X(<w-w N ,z N > + |z N | 2 ). 

Finally, since 

<w-w N ,z N > = <w— w N ,z N > + <w N — w N ,z N > = <w-w N ,z N > — <z N ,z N >, 

the above inequality may be written 

Cjlz^ly * d(q,q N )|w| y |z N | y + c 2 |w - w N | y |z N | y + x<w - w N ,z N > 

< d(q,q N )|w| v |z N | y + c 2 |w - w N | y |z N | y + |X||w - w N | H |z N | H . 

Thus, using |-| H $ k|-| y we find 


c x |z N | v $ d(q,q N )|w| v + (c 2 + |X|k 2 )|w - w N | y , 
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which, from the definition of {w N } and the fact that q N -» q, yields the 
desired convergence z N = w N - w N -* 0 in V norm. 

Since V norm convergence is stronger than H-norm convergence, we can 
use the results of this theorem along with Theorem 2.1 for X = H, X N = H N , 
p N = p N , = A N (q N ), A = A(q) to obtain immediately that T N (t;q N )P N z *♦ 
T(t;q)z for z € H, the convergence being in the H norm. This along with (2.2) 
and (2.6) can be used to argue the desired convergence u N (t;q N ) -• u(t;q) in the 
H norm (assuming, of course, appropriate smoothness of F and u 0 in q € Q). 
However, with no additional assumptions and only a little extra effort, one 
can obtain this convergence in the V norm. As we shall explain below, this 
stronger convergence is often immensely useful in parameter estimation 
problems where the observations may be continuous in the V norm, but not in 
the H norm. 

Theorem 2.3: Under the hypotheses of Theorem 2.2, we have for each z € H, 
T N (t;q N )P N z -» T(t;q)z in the V norm for t > 0, uniformly in t on compact 

subintervals. 

Proof: To establish these results, we first apply the Trotter-Kato theorem in 
the space X = V with X N = H N . Let P*: V -* H N denote the orthogonal 
projection of V onto H N in the V inner product. Then hypothesis (Cl) implies 
Pyf -» f in the V norm for any f € V as well as P N f -* f in the H norm for 
any f € H. 

To carry out our arguments, we shall employ several bounds for resolvents 
similar to but sharper than those of (2.4), (2.5). These bounds follow from 
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results in Tanabe (c.f. Lemma 6.1 of Chapter 3 in [T]); we note that our 
operator A can be extended to A: V -* V* as in [T] so that the results given 
there are applicable as used below. (In [T], the condition (B) is assumed to 
hold with X 0 = 0 and we, without loss of generality, state our bounds for this 
case.) The first bound we shall use yields for f e V 

|R x (A N (q N »f| v « ^ 75 - |f| H (2.8) 

where c is independent of N. Using this with f = P^z - P N z for z € V along 
with the results of Theorem 2.2 we obtain R x (A N (q N ))PyZ -* R x (A(q))z in V 
norm for any z € V. This is (ii) of Theorem 2.1 with X = V, X N = H N and 

P N = Py. 

To obtain the uniform stability bound (i) of Theorem 2.1 in the V-norm, 
we make use of another bound that follows from [T]. From the last bound of 
Lemma 6.1, Chap. 3 of [T] one may readily argue that for f e V 

M 

|R x (A N (q N ))f|y $ |^-|f|y (2.9) 

where is independent of N. The arguments behind (2.9) involve using 
(3.51) of [T] for A* and observing that for 7 € V, v € V* arbitrary so that 
R X (A)7 € V = V** we have (using the usual notation for the duality product) 


(R x (A)7,v) v *« y* — (v,R x (A)7)y*^y — 

$ |R X (A*)v|y*|7|y ^ 


(R X (A*)V,7)y. y 

M. 

|\| l7l vl v lv- 


M. 

Then |R X (A)7| V „ 4 ^“iTly 


and hence, identifying V and V**, we obtain 


IRlIAMv < 


|7| v for 1 e V. 
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We may use the bound (2.9) in the identity 

T N (t;q N ) = | e Xt R x (A N (q N ))d\, (2.10) 

where r is a contour (about X Q ) similar to that given in [P, p. 63], to argue for 
z € H n 

|T N (t;q N )z| v « Me X °‘|z| v (2.11) 

where M is independent of N. Thus one obtains |T N (t;q N )| v $ MeV. (These 
results are essentially given in [T]; one just needs to check the arguments to 
ascertain that the constant M depends on A N only through the bounds c x and c 2 
of conditions (B) and (C) and the sector angle 8 for A N -- where 8 again 
depends only on c 2 and c 2 .) 

We may then apply Theorem 2.1 as indicated to obtain T N (t;q N )PyZ *♦ 
T(t;q)z in V norm for every z € V. It remains to argue that T N (t;q N )P N z -* 
T(t;q)z in V norm for every z € H. 

If we use the bound (2.8) in (2.10), arguments similar to those used to 
establish (2.11) can be made to give 

|T N (t;q N )z| v « Me^V^lzIn, t > 0, z € H N , (2.12) 

where again M is independent of N. 

Recalling that V is dense in H, then given z e H and e > 0, we may 
choose z v e V such that |z y - z| H < e. Then noting that T(t;q)z € V for t > 0, 


we have 
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|T N (t;q N )P N z - T(t;q)z| v S |T N (t;q N )(P N z - P"z y )| y 

+ |T N (t;q N )P y z y - T(t;q)z v | y + |T(t;q)(z y - z)| y . 

From (2.12) wc see that the first term in the right side of this inequality is 
bounded by Me^o t t' 1 ^ 2 |P N z - Pv z vIh which, as N -» ~ approaches 

Me^o 4 ! -1 ^ 2 ^ — Zyfjj. The third term is also bounded by a similar expression, 
while the second term approaches zero as N -* 09 by our previous arguments. 
We thus have established the convergence claimed in the statement of Theorem 
2.3. 


Among the examples that can be treated immediately with the above 
theory are the usual parabolic systems (see [BK1], [L]). To illustrate these 
ideas, consider the estimation problem for the standard Dirichlet boundary 
value problem for one-dimensional parabolic systems. That is. 


9u 

dt 



+ ^(q 2 u) + q 3 u on 
u(t,0) = u(t,l) = 0, 


ft = (0,1) 


with q = (q r q 2 ,q 3 ) to be chosen (via a least squares criterion) from Q, a 
compact subset of C(ft) * C(fl) x C(fl). The state spaces are H = H°(fl), V = 
Hp(ft) and the weak form of the equation is given by 

<z(t),0> + o(q)(z(t),<W = 0, 4 > e V 

where the sesquilinear form is defined as 


o(q)(<P,<W = <q x D<p,D^> + <q 2 <p,D</»> - <q 3 <p,0>. 
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with D = d/dx. One can readily verify that conditions (A) and (C) hold, 
while the coercivity condition (B) is valid if we assume elements of Q satisfy 
q x (x) > v > 0 for some constant v. 

In this case one can specify the domain of A(q) by 

dom A(q) = {<p € Hq(O) | Ql D<p + q 2<P e H 1 ^)} 

and A(q)<p = DfajDcp) + D(q 2 <p) + q 3 <p. For approximating elements one can use 
either piecewise cubic or piecewise linear B-splines modified to satisfy the 
Dirichlet boundary conditions. 

We note that the theory above only requires compactness of Q in the 
[C(ft)] 3 topology along with relatively weak smoothness assumptions when 
compared with other approaches (see [BK3], [BK.L1], [BKL2], [BCK]). 
Furthermore we obtain V = H^fl) convergence of the states. Hence the theory 
is complete for least squares criterion involving pointwise (in x) observations 
of the state. This is obtained for little extra effort when compared with the 
efforts usually required to obtain the stronger convergence (see [BCK], 
[BKL1]). 

Other parabolic examples of great practical importance can be readily 
treated in the context of this framework. For example, the 2-D thermal 
tomography problem mentioned in the Introduction and discussed in [BKO] 
results in a boundary identification problem for parabolic systems. With a 
standard transformation these problems can be readily treated theoretically and 
computationally with the approach of this paper. Details can be found in 
[BKO], 

Another class of problems of interest involves estimation of coefficients 
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in the Fokker-Planck or forward Kolmogorov equations (see [B2]) for the case 
where the Markov transition process for growth is time invariant in size/age 
structured population models. In this case the equations have the form 

du a a 2 

aT + a^ QlU) = aJ (q ’ u) “ q * u ’ x ° <x< x i* 

with boundary conditions 

r 9 l x=x o f x i 

(q 2 u)J =j q 4 U)u(t,5)<H 

x o 

r a i x=x i 

^--(q.u)] =0. 

The associated sesquilinear form is given by (here H = V = 

H^Xq.Xj)) 

<K q)(<p» = “<q 1 «P - D(q 2 <p),D0> - <Kx 0 )R(q 4 )(<p) + <q 3 <p,0> 

with R(q 4 )(<p) = J* 1 Q 4 (09(5)<H. Under appropriate assumptions on Q C L«(ft) 

X wL^(ft) x L^ft) x L^ft), ft = (x^Xj), (see [B2] for details) one can readily argue 
that hypotheses (A), (B), (C) hold. Thus these problems also fall within the 
purview of the theory developed here. 
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3. Second Order Systems 

The ideas developed in the previous section can be applied to parameter 
estimation problems involving second order systems of the form 

u(t) + B(q)u(t) + A(q)u(t) = f(t) (3.1) 

in a Hilbert space H where the operators A(q) and B(q) are defined via 
parameter dependent sesquilinear forms in a manner similar to that of Section 
2. We use the general approach given in [S]; we again assume we are given a 
Hilbert space V C H that is continuously and densely imbedded in H. Let 
a x(Q): V x V -» <E be a symmetric sesquilinear form satisfying conditions (A) 
and (C) of Section 2 and the V coercivity condition (B) with \ Q = 0. Then for 
each q e Q we may define continuous linear operators A(q): V -» V* given by 

^(QXM) - (A(q)M) y , y (3.2) 

for € V, where as customary, we have identified the pivot space H with its 
dual H*, i.e., V C H = H* C V* and the duality product (-,-) v * v * s t ^ ie unique 
extension by continuity of the scalar product of H from H x V to 

V* x v (e.g., see [S] for details). Of course, as in Section 2, we may also view 
A(q) as a densely defined operator in H where o t ( q)(ip,i/>) = <A(q)<p,</>> H for 
<p € dom A(q), 4> € V. The sesquilinear form and its associated operator A 
will correspond to the "stiffness" operator in the examples we treat below. 

A second sesquilinear form will give rise to the "damping" term in our 
examples. We assume we are given* a (not necessarily symmetric) sesquilinear 
form o 2 (q): V x V -» C satisfying conditions (A) and (C) and the semicoercivitv 
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condition 

(B 1 ) H-semicocrcivitv: There exists b > 0 such that for q € Q and </> e V 
we have 

o 2 ( qKM) * b\<k 2 H . (3.3) 

As before, under condition (C) on a y we may define a continuous linear 
operator B(q): V ■* V* given by 

o 2 ( q)(M) = (B(q )<t>,4>) v . y . (3.4) 

Again, we may alternatively view B(q) as being densely defined in H. 
However, since we may wish in general to view equation (3.1) as an equation 
in V* (again following classical formulations [S], [L], [T]> we shall interpret 
A(q) and B(q) as members of £(V,V*). We note that from conditions (B) and 
(C) on o r the form o^q) is, for each q € Q, equivalent to the norm inner 
product in V; indeed,' we can use o 2 (q) to define a parameter dependent 
equivalent inner product in V. We shall use V q to denote the Hilbert space 
consisting of the elements of V equipped with this inner product. 

We shall rewrite equation (3.1) in first order form and to that end we 
define the product spaces V = V x V and K = V x H. If <-,-> v denotes the 
inner product in V, we may use a x and o 2 to define a sesquilinear form 
a : V x V •* d given by 

°(q)((u,v),(<k</0) = + o^qXu,^) + o 2 (q)(v,t/>). (3.5) 

We may then write equation (3.1) in weak or variational form as 


<w(t),X> K + <*q)(w(t),x) = <F(t),X>}f » X e V, 


(3.6) 
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where w(t) = (u(t),v(t)), x = (0,0), F(t) = (f(t),0) and <•,• ># is the usual product 
space inner product. Alternatively, we may write equation (3.1) as 

w(t) = -4(q)w(t) + F(t) (3.7) 


where „4(q) is the operator associated with the sesquilinear form a in the usual 
manner. 

Following this approach, we define in If = V x H the operator 


^(q) = 


’ 0 
- -A(q) 


I 

-B(q) . 


(3.8) 


on dom ^(q) = {(0,0) € Jf | 0 € V and A(q)0 + B(q)0 € H) C V. We can readily argue 
that dom A(q) is dense in K = V x H and that, under the conditions on Oj and o 2 , for 
X > 0 the range of XI — ,4(q) is If (see the arguments given below). Moreover, j4(q) 
is dissipative in Jf q s V q x H since for (0,0) € dom A(q) 

<^(q)(0,0),(0,0)>K = cr!(q)(0,0) - a x (q)(0,0) - a 2 (q)(0,0) 

<1 

4 - b|0|* * 0. 

Thus by the Lumer-Phillips theorem (e.g. see Chap. 1, Thms. 4.3, 4.6 of [P]) we find 
that /4(q) generates a semigroup of contractions on x H. Since V q and V possess 
equivalent (uniformly in q € Q) norms, we have that /4(q) is the infinitesimal 
generator of a C 0 -semigroup T(t;q) on Jf = V x H. If the form o 2 satisfies (3.3) with 
b > 0, then one can show that this semigroup is uniformly exponentially stable, 
i.e., lT(t;q)l S Me^ for some M » 1 and u > 0. Furthermore, if the H 
semicoercivity of (3.3) is replaced by V coercivity, i.e., b > 0 and the H norm is 
replaced by the V norm in (3.3), then T(t;q) is an analytic semigroup on If = V x H. 
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Since we have o(q)(x,0 = -C4(q)X,5 >jf for x € dom ^(q) and ( € V « 
V x V, one might be tempted to apply the arguments and results of Section 2 
to this sesquilinear form with V and K playing the role of the spaces V and H 
in the theory of Section 2 (i.e., treat this system generated by ^(q) as a first 
order system in K and apply without modification the theory in Section 2). If 
one did this, then conditions (A), (B) and (C) must be satisfied by o(q) using 
the norms of K and V. Conditions (A) and (C) pose no difficulty under the 
assumptions of (A) and (C) on o x and o 2 given above. However to argue that 
o(q) is V coercive (condition (B) of Section 2), one finds that the H 
semicoercivity condition (3.3) on o 2 (q) must be strengthened to V coercivity. 
While some damping forms of interest (e.g. strong Kelvin-Voigt damping — we 
discuss this below) do satisfy this stronger condition, several important types 
of damping lead to sesquilinear forms that don’t. Hence we shall modify the 
arguments behind Theorem 2.2 in order to enable us to treat the more general 
cases. 

We shall throughout our discussions henceforth assume that a x satisfies 
conditions (A), (B) with X Q = 0, and (C), while o 2 satisfies conditions (A), (B '), (C). 
Again we shall be interested in a resolvent convergence form of the Trotter-Kato 
theorem and hence must consider the resolvent of the operator A(q) given in 
(3.8). 

To motivate our discussions, consider for X > 0 the equation in K given by 
X = RyC4(q)K for X = (<p,<W, 5 = (0,7). This is equivalent to solving for x € 
dom(/4(q)) in the equation (XI - ^4(q))x = t which may be written 



- 21 - 


X<p — 0 = rj 


(3-9) 


\<l> + A(q)<p + B(q)t/> = y . 

If we substitute the first equation <l> = X<p - n of this system into the second 
we have 


X 2 <p + A(q)<p + XB(q)<p = y + Xn + B(q)n (3.10) 

which must be solved for <p € V. This suggests that we define, for X > 0, the 
associated sesquilinear form o^(q): V x V -* d given by 

o X (q)(<P,0 - x2 <<P.C> h + Oi(q)(«P,0 + Xa 2 (q)(«p,0. (3.11) 

Since Oj is V-coercive and o 2 is H-semicoercive, we find for <p € V 

°x(q)(<p><p) ■ x 2 Hh + 0 i(q)(«p,<p) + Xci 2 (q)(<p,<p) 

> x 2 I<pIh + + xbM*. 

Hence for <p € V 

0xta)(<p»<p) * c iWv + M*pIh > c il < Ply (3.12) 

where X = (X 2 + Xb) > 0. Thus, for X > 0, the form is V-coercive and the 
equation (3.10) is solvable for <p € V for any given 5 = ( 0 , 7 ) in If = V x H. It follows 
then that defining <l> using the first equation in (3.9), we may, for any 5 = (n, 7 ), 
find an element x = (<p,<P) in dom(/4(q)) which solves (3.9); i.e. for X > 0, R^(/<(q)) 
exists as an element in E(K). 

The coercive inequality (3.12) will be the basis of our convergence 


arguments. 
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Wc consider next a Galerkin type approximation scheme for equation (3.7). 
As in Section 2, we assume approximation subspaces H N C V satisfying 
condition (Cl). Then for any w € V = V x V and each N, there exists w €• 
H n x H n satisfying as N -* ® 

|w N — V/\y — 0. (3.13) 

To define the approximating systems, we consider o(q) defined in (3.5) 
restricted to Jf N = H N x H N and obtain in the usual manner the operators 
/l N (q): K n -» K n . Since (3.12) holds, we have immediately that R x (/4 N (q)) exists 
in 2!(Jf N ) for X > 0. Denoting by P N the orthogonal projection of If onto lf N , we 
may then prove the following convergence results which is analogous to that 
of Theorem 2.2 for first order systems. 

Theorem 3.1. Suppose that conditions (A), (B) with X Q = 0, (C) for o v (A), (B'), 
(C) for o 2 and (Cl) hold and let q N -* q in Q. Then for X > 0 we have 
R x 04 N (q N ))P N $ ^ R x 04(q))5 in the V norm far any 5 € If . 

Proof. Let 5 € If be arbitrary and put w = w(q) = R x (^(q))t, w N = w N (q N ) = 
R x (,4 N (q N ))P N L Let w = (<p,4>), 5 = (h,7) so that equation (3.9) is satisfied by 
this pair. Also, letting w N = Op 1 *,^), P N S = (n N ,7 N ), we have that this pair 
satisfies (3.9) with q = q N . In particular, using (3.10) and (3.11) we find for 


5 € V 


o x (q)(<p,5) = <7,5 > H + K*U>h + SfaX'U) 
o x (q N )(<P N ,0 - <7 n ,5> h + + ®j(Q N Xn N ,C). 
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Since w = w(q) € dom(j4(q)) C V, we may choose w N = (^ N ,0 N ) in K N 
satisfying (3.13). Thus, choosing £ = 5 N = <p N — <p N in H N C V, we may use the 
above equalities to obtain 

o x (q)(<P>C N ) " °\(q N )(<P N .£ N ) = <7”7 n »5 n > h + X<n-n N ,C N > H 

(3.14) 

+ o 2 (q)(n,^ N ) - o 2 (q N )(rj I 'U N ). 

We use this to argue that £ N -* 0 in V. 

Applying the estimate (3.12) to the element £ N = <p N — <p N and then using 
(3.14) we find 

c^X + * ®x( t » H Kc N .i: N ) (3.i5) 

= o x (q N )(<p N ,C N ) - o x (q)(<p,? N ) + o x (q)(<p,5 N ) - a x (q N )(v N ,5 N ) 

= <7 n -7,C n >h + x<n N -n,c N > H + o 2 (q)(n,c N ) + o 2 (q N )(n N ,c N ) 

+ o x (q)(<p,C N ) - a x (q N )(v N ,5 N ). 

The last four terms on the right side of this equality may be written as 

o 2 (q)(h N -fU N ) - o 2 (q)( 0 N ,C N ) + o 2 (q N )(r? N ,? N ) 

A . (3-16) 

+ a X (q)(9-tp N ,C N ) + ° x (q)(9 N »£ N ) - o x ( q N )(; N ,5 N ). 

Using the definition (3.11) of o x and the boundedness and continuity in q of 
°i» a 2 (condition (C) and (A)) along with |-| H $ k|-| v we may bound from above 
the expression in (3.16) by 

c 2 |n N -n| v K N | v + d(q,q N )|n N | v K N | v + C|<p-^ N | V K N | V 
+ (l+X)d(q,q N )|i N | v |C N | v . 


(3.17) 
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whcre C = X 2 k 2 + c 2 + Xc 2 . We observe that since P N is the orthogonal 
projection of K onto K N , hypothesis (Cl) implies that (n N ,y N ) = P N (o,y) -» ( 17 , 7 ) 
in K = V x H. Thus, |h N | v $ K for some constant K. Since <p N -* <p in V, we 

Aw 

may also assume |<p”| v i K. 

Recalling that X > 0, we can combine (3.15), (3.16), (3.17) and the above 
observations to obtain the estimate 

Cl |C N | v < k|7 N -7l H + (Xk + c 2 )|n N -r)[ v + K(2+X)d(q,q N ) + C|<p^ N | v , 

and thus £ N -* 0 in V. That is, <p N — <p N -* 0 in V and hence <p N -» <p in V. 

Recalling that (<p N ,<l> N ), (n N ,7 N ) and (%<!>), (h,7) satisfy (3.9) with q N and q, 
respectively, we have that 

ilF = X<p* — n N and <p = X<p — n. 

Thus we find immediately that ** 4> in V. This completes the proof that 
w N =.(<p N ,^ N ) ■* w = (<p ,ip) in the V norm. 

Let T N (t;q N ) denote the C 0 semigroup generated by >4 N (q N ) in K N . Then 
applying Theorem 2.1 with X = K , X N = lf N and A N = /4 N (q N ), we obtain 
immediately from the above discussions the following desired convergence 
results. 

Theorem 3.2. Under the hypotheses of Theorem 3.1, we have for each { € K and 
t > 0, T N (t;q N )P N 5 -* T(t;q)( in K, with the convergence being uniform in t on 
compact subintervals. 


The needed convergence results for parameter estimation problems follows 
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directly from this theorem. In particular, for solutions of (3.7) and their 
approximations we have (assuming, of course, appropriate smoothness in q of 
initial data and the perturbation function F) 

u N (t;q N ) -* u(t;q) in V norm, 
v N (t;q N ) -» v(t;q) in H norm. 

To illustrate the types of problems that can be investigated in the context of 
the theoretical framework developed in this section, we consider a cantilevered 
Euler-Bernoulli beam of unit length and unit linear mass density. The transverse 
vibrations can be described by an equation of the form (3.1) with the operators 
A(q) and B(q) chosen appropriately (see e.g. [BCR], [BR2], [R] for detailed 
equations). We assume that the end at x ■ 0 is fixed, the end atx=lisfree. Then 
the state spaces can be chosen as H = H°(0,1) and V - h£( 0,1) - (<p € H 2 (0,1) | <p(0) - 
D<p(0) - 0). The stiffness sesquilinear form is given by 

OtCqX**) - f EID 2 <pD 2 0 
J o 

where we assume the stiffness coefficient q x (x) = EI(x) satisfies q x (x) > a > 0 
and is in L„(0,1). The operator A(q) in (3.1) is given by A(q)cp = D 2 (EID 2 «p) 
with 

dom A(q) = (<|> € V | EID 2 <f> € H 2 (0,1), EID 2 <p(l) = D(EID 2 <p)(l) = 0}. 

If we denote by q 2 the damping parameter to be discussed in the following 
examples, then the admissible parameter set Q can be taken as a compact 


subset of 
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Q = {q = (q r q 2 ) I Q t € LJO.l), q 2 (x) > a > 0}. 

With this formulation, a number of important damping mechanisms can be 
readily treated. 

Viscous damning: In this case air or fluid damping is usually assumed 

proportional to the velocity of displacement so that the term B(q)u(t) in (3.1) 
has the form bu(t) for q 2 (x) = b(x). The damping sesquilinear form o 2 is given 
by 

o 2 ( q)(<p.<M = <b«p,0> H 

which, for b(x) > 0, satisfies the H-semicoercivity condition (B') but of course 
is not V-coercive. The domain of the operator in (3.8) is given by dom .4(q) == 
dom A(q) x V. 

Kelvin-Voict damping: For these models the damping moment is postulated as 
being proportional to the strain rate and hence the damping term has the form 
D 2 (c D ID 2 u(t)) where q 2 (x) = c D I(x). The associated sesquilinear form is given 
by 

o 2 (q)(<P,</0 = <c d IDVD 2 *> h 

which for c D I(x) £ B > 0 satisfies a V-coercive condition and hence of course 
the H-semicoercivity of hypothesis (B 1 ). Letting M = EID 2 <p + c D ID 2 t|> denote 
the "moment" of the beam, we have the domain of /4(q) given by 

dom A( q) = {(<P,<0 € V x H | <l> € V, M € H 2 (0,1), M(l) = DM(1) = 0}. 

If we only have c D I(x) » 0, then only condition (B 1 ) is satisfied but our 
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theoretical framework is still applicable. We note that results of the 
framework then provide a distinct improvement over the results for the 
problems given in [BCR] (where smoothness assumptions on solutions must be 
hypothesized as well as q 2 > 0) and [KG] (where a stronger and less readily 
characterized topology than that of C(0,1) or L^O,!) must be used for the 
compactness assumption on Q). 


Spatial hysteresis damping: This damping model, which has been suggested 

recently and investigated by Russell in [RU], is based on physical foundations 
that appear particularly appropriate for fiber reinforced composite beams, 
e.g., beams of composite materials in which strands of fibers are lengthwise 
bound in an epoxy matrix. The damping term has the form B(q)u(t) = 
D((G(q) — vI)Du(t)) where G(q) is a compact operator in H°(0,1) defined by a 
symmetric kernel q 2 (xjr) = b(x,y) = b(y,x) > 0 a? 


(G<p)(x) = [ b(x,y)<p(y)dy, <p € H°(0,1) 

' J ft 


with b € H°((0,1) x (0,1)) and v(x) s Jjb(x,y)dy. The associated sesquilinear 
form is given by 


o 2 ( q)(q>,0) = <(vl - G)Dcp,Dt/)> H 

which is readily seen to be H-semicoercive but not V-coercive. The generator 
(3.8) for the semigroup has domain 

dom i4(q) = {((p,0) € V x H | (1 e V, EID 2 «p(l) - 0, 

[D(EID 2 <p) + (vl - G)Di/»](l) = 0). 
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Time hysteresis damping: These models, referred to as Boltzmann damping 
models, have been widely studied in recent years (see e.g. [F], [BF], [HW] and 
the references therein) in connection with flexible structures. In an Euler- 
Bernoulli beam as formulated above, the damping term B(q)u(t) can be 
replaced by a term that has the form 

-D 2 J g(s)D 2 u(t+s, -)ds 

J -r 

where q 2 = g(s) = ae&*/j-s, cc,8 > 0. This means that equation (3.1) becomes a 
functional-partial differential equation or partial differential equation with 
delay or hereditary term. Thus the above theoretical framework is not 
directly applicable. However, as is shown in [BFW], the framework developed 
in this section can be appropriately modified and extended to give a succinct 
theoretical treatment of approximation methods involving estimate 
convergence and continuous dependence on observations for these models also. 

We have used methods based on the theoretical ideas in this section to 
successfully estimate damping in a number of flexible structure experiments. 
To date we have studied viscous, Kelvin-Voigt, and Boltzmann damping in 
vibrations of composite beams [BWIC], [BFW]. 

Finally, we note that the ideas presented in this paper can be extended to 
provide a framework for the treatment of nonlinear distributed parameter 
systems. The coercivity conditions are replaced by monotonicity assumptions 
and, of course, nonlinear semigroups (evolution systems) and a nonlinear 
Trotter-Kato approximation theorem play fundamental roles in development of 
this theory. Details can be found in a forthcoming manuscript [BRR]. 
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